Challenges of Inversely Estimating Jacobian from Metabolomics Data
نویسندگان
چکیده
Inferring dynamics of metabolic networks directly from metabolomics data provides a promising way to elucidate the underlying mechanisms of biological systems, as reported in our previous studies (Weckwerth, 2011; Sun and Weckwerth, 2012; Nägele et al., 2014) by a differential Jacobian approach. The Jacobian is solved from an overdetermined system of equations as JC + CJ(T) = -2D, called Lyapunov Equation in its generic form, where J is the Jacobian, C is the covariance matrix of metabolomics data, and D is the fluctuation matrix. Lyapunov Equation can be further simplified as the linear form Ax = b. Frequently, this linear equation system is ill-conditioned, i.e., a small variation in the right side b results in a big change in the solution x, thus making the solution unstable and error-prone. At the same time, inaccurate estimation of covariance matrix and uncertainties in the fluctuation matrix bring biases to the solution x. Here, we first reviewed common approaches to circumvent the ill-conditioned problems, including total least squares, Tikhonov regularization, and truncated singular value decomposition. Then, we benchmarked these methods on several in silico kinetic models with small to large perturbations on the covariance and fluctuation matrices. The results identified that the accuracy of the reverse Jacobian is mainly dependent on the condition number of A, the perturbation amplitude of C, and the stiffness of the kinetic models. Our research contributes a systematical comparison of methods to inversely solve Jacobian from metabolomics data.
منابع مشابه
Solving the Differential Biochemical Jacobian from Metabolomics Covariance Data
High-throughput molecular analysis has become an integral part in organismal systems biology. In contrast, due to a missing systematic linkage of the data with functional and predictive theoretical models of the underlying metabolic network the understanding of the resulting complex data sets is lacking far behind. Here, we present a biomathematical method addressing this problem by using metab...
متن کاملEnhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining
This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...
متن کاملProduction Data Analysis in a Gas-Condensate Field: Methodology, Challenges and Uncertainties
Production data analysis techniques have been widely used for estimating reservoir properties such as gas in place and average pressure. Knowledge of this information is crucial for proper reservoir management. The present study discusses the roadmap, challenges and uncertainties for analyzing production data in an Iranian gas-condensate offshore field. This work is an integrated study in...
متن کاملMarkov models for digraph panel data: Monte Carlo-based derivative estimation
A parametric, continuous-time Markov model for digraph panel data is considered. The parameter is estimated by the method of moments. A convenient method for estimating the variance–covariance matrix of the moment estimator relies on the delta method, requiring the Jacobian matrix—that is, the matrix of partial derivatives—of the estimating function. The Jacobian matrix was estimated hitherto b...
متن کاملMixed Linear Model Approaches of Association Mapping for Complex Traits Based on Omics Variants
Precise prediction for genetic architecture of complex traits is impeded by the limited understanding on genetic effects of complex traits, especially on gene-by-gene (GxG) and gene-by-environment (GxE) interaction. In the past decades, an explosion of high throughput technologies enables omics studies at multiple levels (such as genomics, transcriptomics, proteomics, and metabolomics). The ana...
متن کامل